Recognition Of Voice Using Mel Cepstral Coefficient & Vector Quantization
نویسندگان
چکیده
Human Voice is characteristic for an individual. The ability to recognize the speaker by his/her voice can be a valuable biometric tool with enormous commercial as well as academic potential. Commercially, it can be utilized for ensuring secure access to any system. Academically, it can shed light on the speech processing abilities of the brain as well as speech mechanism. In fact, this feature is being used preliminarily along with other biometrics including face and finger print recognition for commercial security products. Speaker recognition is the method of automatically identify who is speaking on the basis of individual information integrated in speech waves. There are two types of speaker recognition systems basically divided into two – classification: speaker identification and speaker verification. Speaker identification determines from which of the registered speakers a given utterance comes whereas speaker verification is the process of accepting or rejecting the claimed identity of a speaker .The fundamental difference between identification & verification modes is the number of decision alternatives. In the Identification mode the number of decision alternatives is equal to the size of the population, whereas in the verification mode there are only two alternatives, accept or reject the Identification claim, regardless of the size of population. Most applications of speaker recognition are actually speaker verifications. Speaker Recognition is of two types :Text Based and Text independent. In Text based approach, the speaker is identified by the utterance of some fixed piece of text while in the text independent approach the speaker is allowed to utter any text whatsoever.
منابع مشابه
Speaker Recognition using MFCC and Improved Weighted Vector Quantization Algorithm
Speaker recognition is one of the most essential tasks in the signal processing which identifies a person from characteristics of voices . In this paper we accomplish speaker recognition using Mel-frequency Cepstral Coefficient (MFCC) with Weighted Vector Quantization algorithm. By using MFCC, the feature extraction process is carried out. It is one of the nonlinear cepstral coefficient functio...
متن کاملSpeaker Identification and Verification using Vector Quantization and Mel Frequency Cepstral Coefficients
In the study of speaker recognition, Mel Frequency Cepstral Coefficient (MFCC) method is the best and most popular which is used to feature extraction. Further vector quantization technique is used to minimize the amount of data to be handled in recent years. In the present study, the Speaker Recognition using Mel Frequency Cepstral coefficients and vector Quantization for the letter “Zha” (in ...
متن کاملVoice-based Age and Gender Recognition using Training Generative Sparse Model
Abstract: Gender recognition and age detection are important problems in telephone speech processing to investigate the identity of an individual using voice characteristics. In this paper a new gender and age recognition system is introduced based on generative incoherent models learned using sparse non-negative matrix factorization and atom correction post-processing method. Similar to genera...
متن کاملSpeaker Identification Based on Vector Quantization
In this paper a method of text-independent speaker recognition using discrete vector quantization is presented. The identification experiments were performed in a closed set of 599 speakers and two various types of features were tested: cepstral mean subtraction coefficients and mel-frequency cepstral coefficients. The effect of the various codebook size on the speaker identification performanc...
متن کاملA Vector Quantization Approach for Voice Recognition Using Mel Frequency Cepstral Coefficient (MFCC): A Review
This paper presents a brief survey on Automatic Voice Recognition so as to provide a technological perspective and an appreciation of the fundamental progress that has been accomplished in area of voice communication. The voice is a signal of infinite information. After years of research and development the accuracy of automatic voice recognition remains one of the important research challenges...
متن کاملComparative Study of MFCC And LPC Algorithms for Gujrati Isolated Word Recognition
The study performs feature extraction for isolated word recognition using Mel-Frequency Cepstral Coefficient (MFCC) for Gujarati language. It explains feature extraction methods MFCC and Linear Predictive Coding (LPC) in brief. The paper compares the performances of MFCC and LPC features under Vector Quantization (VQ) method. The dataset comprising of males and females voices were trained and t...
متن کامل